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Abstract 


The inclusion problem for the class of monadic recursion schemes is 
shown to be undecidable. The proof illustrates the close relationship 
between monadic recursion schemes and deterministic pushdown automata. 
The proof is extended to show that both the weak equivalence problem 
for the class of monadic recursion schemes and the weak equivalence 
problem for the class of free schemes without identity are undecidable. 



Schemes can be viewed as abstract models for computer programs. 

They allow us to study aspects of a computation that are independent of 
the actual functions, variables and predicates involved. In this paper 
we shall be concerned with one particular class of schemes--monadic 
recursion schemes. 

In a monadic program scheme there is exactly one variable x, a 
set of unary functions (fQ,...,f m ) that assign values to the variable 
x, and a set of unary predicates -(Pq» •••♦?) that determine the flow 
of computation. In addition, there is a set of function variables 
(Fq,...,Fj_) that are defined below. 

Let a term be defined in the usual way as constructed from functions 
and function variables applied to the variable x, e.g., 

f l (F 2 (F 4 (f 0 (x)))) » x » F l (x) 

A conditional term is any expression of the form 

if p^(x) then else 

where T^, are terms or conditional terms, A function variable 

definition is : 

F. T, 

where T is any term or conditional term. Since x is the only 
variable, we will abbreviate 
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F 1 (x) «- if p 2 (x) 

then 

f 1 (F 2 (x)) else x 

as 

F^ *■ if p 2 then 

f F- 

12 

else ID, 


where ID is the identity function. 

We can now formally define a monadic recursion scheme S as a 
5-tuple S = (V, 2>»Fg) , where 

V = a finite set of function variables 
&= a finite set of functions 
SP - a finite set of predicates 

3) - a finite set of function variable definitions (exactly one 
for each elememt in V) 

Fq e V the distinguished initial function variable 

Unless otherwise noted "scheme" shall mean "monadic recursion scheme". 

If we assign a value to the variable x and associate actual functions 
and predicates with the scheme (e.g., f^(x) ^ , p 2 (x) ^ x “ 0) , then 

the scheme can be looked upon as an executable program. Such associations 
are called interpretations . Formally, an interpretation I has domain D 
of possible storage values (for the variable x) , and a distinguished 
element e D used as the initial value of x (or, rather, the input) . 

Since the scheme is monadic, this is the only treatment of variables 
necessary. For each function f, there is a total function 1(f): D D, 
and for each predicate p, a total function I (p) : D -* {T,F}. So, for 
any given interpretation I, the scheme S can be evaluated in the 
normal sense by applying the initial function variable to input d^. 

The computation either terminates, yielding a value called val^(S), or 
it diverges and val^(S) is undefined. 
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Given two schemes S and S', we say that S is less defined than 
S' (SES 1 ) iff for every interpretation I, whenever val T (S) is 
defined, then val^(S') is also defined and valj(S) = valj(S'). Because 
of the obvious difficulty with proving properties over all interpretations, 
we define a more restrictive type of interpretation. An interpretation 
will be called free if for variable x, I(x) = e (i.e., the empty word), 
and 1(f) (a) = fa, where this is just the concatenation of the function 
symbol f to the string a e Dom(I). Note the resemblance to the 
Herbrand Universe, In fact, such interpretations are often called Herbrand 
interpretations in the literature. It is important to realize that we 
have not restricted the predicates; because of this, we can obtain an 
infinite number of free interpretations for any scheme. This notion of 
free interpretation now yields the following useful result: 

Lemma : Given any two schemes S and S’, S^S’ iff for all free 

interpretations I, whenever valj(S) is defined, then valj(S') is 
defined and val^(S) = vales'). 

Proof . This is similar to the result on equivalence in [5]. 

To show that a problem is unsolvable, it ‘Is often convenient to use 

the unsolvability of the Post Correspondence Problem (PCP) . The Post 

Correspondence Problem is defined as follows: Let I be a finite set 

containing at least two elements, and let ^ be a non-empty sequence 

+ 

pf 2- tuples of strings in £ . For example, 
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ST- (x 1 ,y 1 ),...,( V y n ) 

where foir i = 1. . . . ,n, x^.y^ s £ . 

This is an instance of the PCP. The sequence of indices i^ t ,..,i with 

t > 1 is a solution to this instance, of the PCP if x, . ,.x. = y. . ,.y . 

It X 1 i t 

It is well known that the PCP is undecidable. 

In [6], Paterson states that the question of whether or not the 
inclusion problem (S ES 1 ) for monadic recursion schemes is decidable is 
open. The following theorem shows that this problem is undecidable. First 
we shall sketch a proof of the known result that the inclusion problem for 
languages accepted by deterministic pushdown automata (dpda) is undecidable. 
This construction [6] will give an indication as to how we will later 
prove the main theorem of this paper. 

Let £ = {a,b> and let £P = (x^.y^ , . . . , (x^.y^) , where for 
i = l,.«.,n, x ^»y^ £ Encode the indices l,...,n, as symbols 

f^,...»f n , respectively. Define the following two languages ^ 1^2 — 

{ a » b t c 5 $ , f f 2» • * * i f 

R R l 

L, * {f ...f cx. ...x, $ | t > 1 and i.,.., > t. are indices 

1 ±1 i t 1 t 1 1 ‘ 1 1 

from 1 to n) 

L„ = {f. ...f, cw$ I t > 1 and i 1 ,..,,i are indices from 

2 i fc I t 

. R , 

and w f y , ,.y. / 


1 to . n, 
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Both and L ^ can be accepted by dpda's, and ^2 — ^2 there does 

not exist a solution to the PCP for f/\ Thus, the inclusion problem for 
dpda's is undecidable. 

Theorem : The inclusion problem for monadic recursion schemes is undecidable 

Proof : Let I = {a,b} and let = (x^,y^) , . ♦ . , (x n ,y n ) where 

for ± = 1 , , , , ,n, x i ,y 1 e I + . 

* * 

Let r = {A,B}, and define a homomorphism h: £ ■* T determined by 

h(a) = A, h(b) = B. 

^ . A A . A jl> 

Let r = and define the function g: E IT as follows: 


if zw e I + where 

z e (a,b) 

g(zw) = (A h(w) , 

if z ~ a 

t B h(w) , 

if z — b 


We will now define two schemes S, S' such that SES* iff there does 
not exist a solution to the PCP for We will see that SES' => 

(valj.(S) | I is a free interpretation for S} E 
{vales') | X is a free interpretation for S'}. 

Let S = (V, & , % S' t Fq) t where 

V = {F 0 ,F 1 ,...,F n }Ur U{X,U}, 

{a,b,c,$} u { f x f n ), 

'( Ti * • • • »T„} > p, ,p 4 } , and is defined as follows: 

1 n a D y 

(The comments that follow point out the similarities to acceptance by 
a dpda) : 
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Fq ■*- XF^ \ Hark the end of a computation. This 

> is similar to placing a marker on the 
) bottom of the pushdown store in a dpda. 


if 

«- if q 2 


F if q 
n ^n 


then h(x^)F^f^ 
then h(x 2 )F^f 2 


then h(x ) Ft f 
n In 


else F 
else F 


else c 



h(x^) is a string that encodes x^. 

Note how this is like pushing the 
string x^ onto the pushdown store 

when an index (f^,) is read from the 

input string. The operation c indicates 
the end of "reading" indices. 


A <- 

if 

P a 

then 

a 

else 

U 

B «- 

if 

P a 

then 

U 

else 

if 



p b 

then 

b 

else 

U 


X *■ if p then U else if 
r a 

p, then U else if 

D 


p* then $ else U 


Recall that Mxp is a string in {a,b) + . 

We can view predicates p^, p^> and p$ as 

"testing if the symbol read from the input 

string is an a, b, or $ , respectively." 

Thus, A' is undefined ("rejects") unless 

D is true; B is undefined unless p, is 
a b 

true. 


\ X is the leftmost function variable, and 
I it is undefined unless p^, are false 

/ and is true. That is, we have reached 

/ "the end of the string". 


U U 


This is a loop, so that whenever U is 
encountered in a computation, the value 
of the scheme is undefined for that 
interpretation. 


is clear from the definition of S that 


(valjCS) | I is a free interpretation for s) = 

: {$x. ,x. cf ' V. .f . | for j=l,...,t, 1 < i < n}U{$c} 

X 1 X t X t \ 1 


It 



*y> * 1 > * 1 > 
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Vie could have written a "simpler" scheme that would have also produced 
the same set of values by using the following definitions of 2 ) : 


F^ •<- if q^ then x^F^f^ else F£ 


F If q then x F, f else c 
n n n 1 n 


It will become clear later why we have chosen not to proceed in this manner. 
Define scheme S' as follows: S’ = (V 1 , 2)’ ,F’) , where V’ = 

{ Fi FnUrUrUfT.T.E.lDUt^ F n ), and 2 > 1 is defined as follows: 


•*- if q 1 then ?(.V 1 ) F 1 f 1 else F 2 ' 
if q 2 then S(y 2 /' F ^ f 2 e ^- se F 3 


P' ■<- if q then g(y )F,f else $c 
n n ■ n i n 


^ if then h(y else 

2 + if ^2 then h(y 2 ) F q F 2 e ^ se ^3 


if q then h(v )F,f else c 
n 'n 1 n 


The same comments apply as for - scheme 

S, except that the leftmost function 

variable is noxv T marked with a Also, 

F’ has $c as the else clause instead of 
n 

c, since we do not have a leftmost 
indicator X, 


same comments as for scheme S 
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A *■ if p then 
r a 

p^ then 
p$ then 


a else if 
Tb else if 
ID else U 



If p is true ("a is read from the 

input string"), then continue computing 
for the remaining function variables* 
Otherwise, if is true, then we 


"accept the string by reading until 
$ is reached" (via function variable 
T) . However, if p,s is true, then the 


"string is shorter than 


■Yi . 
x t 


so 


accept by reducing to ID. All remaining 
function variables also reduce to ID 
until the leftmost is encountered (A or 
B) , which then computes $, 


B •*- 

if 

t> 

■ a 

then 

Ta 

else 

if 



Pb 

then 

b 

else 

if 



P $ 

then 

ID 

else 

U 

A 

A -e 

if 

P a 

then 

Ea 

else 

if 



p b 

then 

Tb 

else 

if 



Pg 

then 

$ 

else 

U 


dual of A above 


This .is the leftmost function variable 
so if p is true, then we must check 

via function variable E whether the 
"next symbol in the input string is 
the endmarker $, indicating the end 
of the string." Otherwise, "accept 
the remaining string that ends in $." 


A A 


B «- 

if 

p a 

then 

Ta 

else 

if 



p b 

then 

Eb 

else 

if 



P $ 

then 

$ 

else 

U 

T -e- 

if 

p 

then 

Ta 

else 

if 



a 







fp b 

then 

Tb 

else 

if 



P* 

then 

ID 

else 

IT 




dual of A above 


This acts like a state in a dpda that 
reads until the endmarker $; when 
p$ is finally true, the ID function 

causes all remaining function variables 
to also reduce to ID until the leftmost 

. A A 

is encountered (A or B) , which then 
computes $. 
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Similar to T above, but T is 
encountered only when no other 
function variable remains to be 
computed. Hence, when p<, is true, 

the function $ i3 applied immedi- 
ately. 


This basically checks for the end- 
marker $. p. true indicates that 

V 

we have "read a string 

$y. ...Vj cfi . ..f,. , so reject." 

X 1 t t *1 „ 

Otherwise, "accept" via T. 

U U | Loop , 


It is implicit in the comments above that 

(values') | I is a free interpretation for S'} = 

{ $wcf , . ,,f. | for j=l,...,t, 1 < i. < n, and w e 

1 t X 1 “ 1 

w i* y, . . ,y. } U{$c} 

X 1 X t 

The theorem follows from the undecidability of the correspondence 
problem and the following claim. 


Claim . S £ S' iff there does not exist a solution to the PCP for S? 

Proof . <= Suppose that S is not less defined than S'. Clearly, 

for any free interpretation I, S and S' apply functions in 

ff^,..T»f } in the same order, so we only need to check what happens in a 

computation after the application of function c. The only predicates 

involved there are p . n, and p„. These can be viewed as testing 

a ‘ b “5 

if the function to be applied is an a, b, or $, respectively. Both 
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S and S’ are constructed so that each such function variable first 

tests then p^, and then n^. Except where the loop function variable 

U is encountered, both schemes S and S' behave such that if p is 

a 

true then function a is applied; if is false and is true then 

function b is applied; if p^, p^ are both false and is true then 

function $ is applied and the computation terminates. Thus, we can see 

that there can be no free interpretation I such that valj(S) and 

valj(S') are both defined but valj(S) f val^S 1 ) . So, since it is not 

the case that S^s', it is sufficient to consider free interpretations 

I such that valj(S) is defined and valj(S') is undefined. By the 

reasoning above, we can see that this can only occur when the computation 

reaches a point where function variable E (in scheme S') must be replaced 

bv its definition, where p , t>, are false but p^ is true. Since S is 

a b ? 

not less defined than S', S is defined here, and S' is undefined. 

But this is the case where val T (S) = $wcf. ...f, , where w = 

X l 

• t 1 

x, ...x, = v. . . .y . . Thus, i 1 ,...,i is a solution to the PCP for 

1 1 i t ' X 1 t r 


==> Suppose S ES' , Let I be any free interpretation such that 

val T (S) is defined— -hence , val T (S) = val T (S') = $x, . ,.x. cf. ...f . 

1 1 t t 1 

In scheme S', immediately after application of the function c, we' 


have cf . . . . f as the value of the variable, with the string of function 

ML , 

variables a = e(y. )h(y. ).,.h(y ) remaining to be computed according to 

1 1 X 2 ' i t 

the interpretation I. We have four possible cases to consider: 
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1) The interpretation I is such that all function variables in a are 

computed with p true whenever function variables A or X are con- 
's 

sidered (so that function a is applied) , and p & is false but is 

true whenever function variables B or B are considered (so that 
function b is applied). Hence, the variable will eventually have value 
wcf. . ,,f. , where w = v. , ,.v. . and function variable E remains to 

b b b b ■ 

be computed for interpretation I. But for valj(S’) to be defined, we 
must have either p true (where function a is applied) or p false 

”3 SL 

and p^ true (where function b is applied). Hence, val^S') = 

$x. , ,.x, cf. . ,.f. , where y. . . .y . is a proper suffix of x. . »,x., . 

X 1 X t i l X 1 t X 1 t 

Thus, i^,,,,,i is not a solution to the PCP for g 7 . 


2) The interpretation I 
in a is computed with p a 
the value of the variable. 
^ Z 1* Z 2 ,Z 3 G ^ ' suc h that 


is such that some function variable A or 
false and p^ true, so b is applied to 
Since val T (S’) = $x. ...x, cf . . ,,f. , 

1 b b b b 


x. ...X = Z-b2- and y ...y. = z„az„. 

X 1 1 t 1 ■ X 1 J 


Hence, i^,,,.,i is not a solution to the PCP for Sf 7 * 


3) The interpretation I is such that some function variable B or 
in a is computed with p true, so a is applied to the value of the 
variable. .This is just the dual of 2), so again i^...,i t is not a 
solution to the PCP for SP, 


w> >> 
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4) The interpretation I is such that some function variable A, B, 

A- A 

A, or B in a is computed with p^, false and true. All 

function variables (A, B) in a remaining to be computed are replaced 

A A 

by ID until only one function variable (A or B) remains to be computed. 
[Note that S' is constructed so that there is exactly one occurrence of 

A A ■ . 

either A or B in any computation. ] Since we still have p , p, 

3.D 

A A 

false and p^ true, A and B cause function $ to be applied, thus 

ending the computation. Hence val T (S') = $x. . ,.x. cf. ...f, , where 

1 X t X t X 1 

x. , ,.x_, is a proper suffix of y. . , ,y_, . Thus, i-,»..e,i^ is not 
X l t ’ X 1 t 1 C 

a solution to the PCP for Q 

Two other relations between schemes are the following: 

Strong Equivalence . S = S' iff for every (free) interpretation I . either 
both valj(S) and vales') are undefined, or both are defined with 
val^CS) = valj(S'). 

Weak Eouivalence . S = S' iff for every (free) interpretation I either 
valj(S) o_r valj(S') is undefined, or both are defined with val^(S) = 
valj(S') . 

The three relations described, in this paper ( = , £,-) are all 
reasonable relations in the terminology of [5], That is, let S and 


S' be. any two schemes and 'v be any relation between S and S'. 
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Then ^ is reasonable on the class of monadic recursion schemes if for 
any two schemes S and S', 

1) S 5 S' => S ^ S' 

2) S a. S' => S = S' 

We have just shown the undecidability of the inclusion problem for 
monadic recursion schemes. The decidability of the strong equivalence 
problem for schemes remains open, however the construction of the above 
proof gives us another result. 

Corollary . The x^eak equivalence problem for monadic recursion schemes 
is undecidable. 

Proof . The construction is similar to the one above, except now 
define the function variable E in scheme S' as 

E if p then Ta else if 

a 

A 

p, then Tb else if 

b 

then a else U 

$ a 

A scheme is free iff for every free interpretation the computation 
of the scheme has no predicate that ever tests the variable x with the 
same value more than once, Ashcroft, Manna, and Pnueli [1] prove that 
it is decidable x^hether or not a scheme is free. In the proof of the 

main theorem above, scheme S is free, but scheme S' is not free. The 

■ - ; ' V- - 

non-freedom of S’ is introduced by the use of the ID function in the 
definitions of function variables A, R and T. This non-freedom is 


I 




1 


< Page Intentionally Left Blank 



16 


essential to the proof. The strong equivalence problem for free schemes 
is known to be decidable [1], whereas the inclusion problem for free 
schemes is open. 

K.orenjak and ilopcroft [4] define a type of pushdown automaton called 
an s-machine. This is a real-time (no e-moves) deterministic pushdown 
automaton with only one state that accepts by empty store. The inclusion 
problem for languages accepted by s-machines is still open. By appro- 
priate encoding, it can be shown that this problem is equivalent to the 
inclusion problem for free schemes with no ID function. 

Theorem . The weak equivalence problem for free schemes without identity 
is undecidable. 

Proof . The proof technique is similar to that used in the theorem 
above, bet I, h and S be defined as in the proof of the previous 

theorem. We will define a new scheme S" such that S - S" iff there 
does not exist a solution to the PCP for SP> 

S" = where 

V” = {F^,F”,...,F^}U{G 1 G n > U(A,B,X",U) 

<j?' = (fp . . . ,f n ) U (a,b,c,$} 

g>" = (q^.. .,q n > U(p a »P b .P^} 


and Q)" is defined as follows: 
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T?H X. vlli;** 

Jq X i x 


F" *■ if q 1 then h(y 1 )G 1 f 1 else .F" \ Same comments as for scheme S, 

I except the else clause of F" is 
if a „2 t ^ ien h(y 2 )G 1 f 2 else F'^ | now a loop ( 

• 

F" ^ if q then h(y )G 1 f else U 
n n 'n 1 n 


G^ ■** if q^ then h(y^)G^f^ else G 2 
G 2 *■ if q 2 then My^G.^ else G^ 


Same comments as F, for 

, „ I n 

scheme S. 


G *■ if q then h(y )G_f else c 
n n • n 1 n 


A if 

P a 

then 

a 

else 

U 

) 

B «- if 

P a 

then 

U 

else : 

i 

Lf 

> Same comments as for scheme S, 


p b 

then 

b 

else 

U 

* . 

X" + if 

P a 

then 

U 

else 

“ ! 

X" is always the leftmost function 
variable to be computed, so terminate 


p b 

then 

U 

else 

" 1 

> only when p^, p^ false and p^ 
true. 


p $ 

then 

a 

else 

u ) 

1 


U «- U 


| Loop . 


We can see that for any free interpretation I with both val^(S) and 
va.lj.(S") defined, we have valj(S) = $wcf^ »..f^ and valj.(S") = 
awf. ,,»f. , where v = x. . ,.x, = y . . ,,y. . Also, if i ,i 

x t . h ' b h 1 i l t 1 11 

is a solution to the PCP for <jP f then there exists a free interpretation 

I such that val T (S) = $x. ...x, cf. ...f and val T (S") = 

X 1 t X t 1 1. 



18 . 


ay , ,.y cf, ,,,f. . Hence, S = S" iff there does not exist a 
1 1 i t i t 3 1 

solution to the PGP for SP . Thus, the undecidahility of the weak 
equivalence problem for free schemes without ID follows from the 
undecidability of the PCP. Q 

An immediate consequence is: 

Corollary . The weak equivalence problem for free schemes is undecidable. 

There are subclasses of monadic recursion schemes that have a known 
decidable inclusion problem. For example, we can subclassify monadic 
recursion schemes as linear [3] if each term in a function variable 
definition contains at most one function variable. The inclusion and 
weak equivalence problems for linear schemes (not necessarily free) are 
shown to be decidable in [2], 
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